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The present invention provides mutants of the Green Fluorescent Protein (GFP) of Aequorea victoria. Specifically provided by the 
present invention are nucleic acid molecules encoding mutant GFPs, the mutant GFPs encoded by these nucleic acid molecules, vectors 
and host cells comprising these nucleic acid molecules, and kits comprising one or more of the above as components. The invention also 
provides methods for producing these mutant GFPs. The fluorescence of these mutants is observable using fluorescein optics, making the 
mutant proteins of the present invention available for use in techniques such as fluorescence microscopy and flow cytometry using standard 
FITC filter sets. In addition, certain of these mutant proteins fluoresce when illuminated by white light, particularly when expressed at 
high levels in prokaryotic or eukaryotic host cells or when present in solution or in purified form at high concentrations. The mutant GFP 
sequences and peptides of the present invention are useful in the detection of transfection, in fluorescent labeling of proteins, in construction 
of fusion proteins allowing examination of intracellular protein expression, biochemistry and trafficking, and in other applications requiring 
the use of reporter genes. 
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Mutants of Green Fluorescent Protein 



BACKGROUND OF THE INVENTION 

Field of the Invention 

This invention is in the fields of molecular and cellular biology. More 
particularly, the invention is directed to mutants of the genes encoding Green 
Fluorescent Protein (GFP) and the proteins encoded by these mutants. The 
mutant GFPs are used to allow detection of eukaryotic and prokaryotic cells 
transfected or transformed with extrinsic genes, and to label proteins of interest 
to facilitate their localization within viable cells. 

Related Art 

Transfection of Foreign Genes 

To study the function of a gene, a technique that is commonly employed 
is the transfer of the gene into a new cellular environment. This process, called 
"transfection," provides several advantages to the genetic scientist. For example, 
the cellular protein encoded by the gene can often be more easily studied by 
transferring the gene into a cell or organism that normally does not produce the 
protein, and then examining the effect of this protein on the host cell. The 
existence and function of regulatory genetic sequences (e.g., promoters, inhibitors 
and enhancers) may be elucidated by transfection of foreign genes into cells 
containing the regulatory sequences. The transfer of non-native or altered genes 
into a host cell also allows for large-scale production of the proteins encoded by 
the genes, a process upon which much of the current biotechnology industry is 
based. Transfection of plant embryos with foreign genes has provided genetically 
engineered plants that are more resistant to adverse environmental conditions or 
that are more nutritionally rich. Finally, gene transfer methods allow the 
introduction of new or mutated genes into whole organisms. This latter capability 
provides the opportunity for the construction of stable models of mammalian 



diseases, for large-scale production of proteins in the milk of transgenic lactating 
animals, and for the possibility of genetic therapy for certain diseases. 

A variety of techniques has been used to transfect non-native genes into 
cells (reviewed in Sambrook, J., et aL y Molecular Cloning, a Laboratory Manual, 
2nd Ed., Cold Spring Harbor, NY: Cold Spring Harbor Laboratory Press, pp. 
16.30-16.55 (1989); Watson, J.D., etal. y Recombinant DNA, 2ndEd, New York: 
W.H. Freeman and Co., pp. 213-234 (1992)). These techniques include biological 
methods such as the use of viruses (e.g., adenovirus or certain retroviruses for 
mammalian cells, baculovirus for insect cells and bacteriophages for bacterial cells) 
or bacteria (e.g., Agrobacterium for plant cells), chemical methods such as 
calcium phosphate precipitation, DEAE-dextran-mediated endocytosis or 
liposome-mediated transfection, and physical methods such as electroporation or 
direct microinjection. For transfection of mammalian cells, the techniques most 
commonly employed currently are virus-mediated transfection, lipofection and 
electroporation. 

Detection of Gene Transfer 

Regardless of the method used, however, simply attempting to transfect 
a cell does not guarantee that a majority (or even any) of the target cells will take 
up and/or express the exogenous DNA. Indeed, it has been suggested that the 
success rate of even the most optimal techniques used for transfection results in 
stable transfer of exogenous DNA is far less than 1% (Watson, J.D., et al. 9 
Recombinant DNA, 2nd Ed, New York: W.H. Freeman and Co., pp. 216, 218 
(1992)). Thus, it is usually critical to determine which target cells have received 
and/or incorporated the gene(s) being transfected, for which a number of 
methodologies have been used. 



Expression 

The most obvious of these methods is to simply examine the target cells 
for expression of the exogenous gene. In this method, the transfected cells are 
grown in vitro and assayed for the presence of the protein encoded by the 
transferred gene. These assays are usually accomplished using immunological 
techniques such as Western blotting, ELISA or RIA. This type of technique is 
only useful, however, if the protein is produced in relatively high amounts 
(generally at the microgram level or above) and if suitable antibodies are available, 
neither of which is the case for some transfected genes. 

In those cases where protein expression cannot be examined, incorporation 
of exogenous genes can be determined by assaying the target cells for production 
of the mRNAs corresponding to the transferred genes. One very common 
technique for this determination is Northern blotting (Alwine, J.C., et ai y Proc. 
Natl Acad ScL USA 7*5350-5354, 1977), in which RNA molecules are isolated 
from cells, separated by gel electrophoresis and electroblotted onto a solid support 
(e.g., nitrocellulose or nylon). The solid support is then overlaid with 
radiolabelled cDNAs corresponding to the transfected gene, which hybridize on 
the solid support to their complementary mRNAs. After exposing the blot to 
photographic film, the samples containing the expressed transgene are easily 
determined. While this method is more sensitive than those directly measuring 
protein expression, Northern blotting still relies on actual expression of the gene 
by the target cells, which is not always the case. 

Selection 

Another method for determining gene transfer, alternative to directly 
measuring gene expression, is to examine the effect of the gene on the transfected 
cells. For example, some transfected genes will confer upon their host cells the 
ability to grow in selective culture media or under some other environmental stress 
which non-transfected cells cannot tolerate. Genes of interest are often engineered 
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into sequences conferring, for example, antibiotic resistance upon the recipient 
cells. Transfectants with these constructs will thus carry not only the gene of 
interest but also the antibiotic resistance gene which allows them to grow in 
antibiotic-containing media. Since non-transfected cells will not possess this 
5 resistance, any cell able to grow in media containing antibiotic will contain the 

resistance marker (the so-called "selectable marker") and the transgene that is 
linked to it. Selectable markers commonly used in such an approach are the 
neomycin (neo), ampicillin (amp) and hygromycin (hyg) resistance genes. 

In the same way, selectable markers conferring on the transfected cells a 

10 metabolic advantage (e.g., ability to grow in nutrient-deficient media) have been 

used successfully. Examples of these types of selectable markers include 
thymidine kinase (Bacchetti, S., and Graham, F.L., Proa Natl Acad ScL USA 
7*1590-1594 (1977); Wigler, M., et aL, Cell 77:223-232 (1977)) and xanthine- 
guanine phosphoribosyltransferase (Mulligan, R.C., and Berg, P., Proc. Natl 

15 Acad ScL USA 75:2072-2076 (1981)), which impart to their recipients the ability 

to grow, using metabolic rescue pathways encoded by the marker genes, in media 
that inhibit vital metabolic pathways in non-transfected cells. Again, any cells able 
to grow in such media will contain the transgene linked to the marker gene. 

Selection methods such as these often require weeks of culturing of the 

20 cells, continuously under selective pressure, to provide a relatively pure population 

of stable transfectants. Many uses of transfected cells, however, are conducted 
within hours of transfection, far too soon to determine transfection success using 
either the expression or selection methods described above. These types of 
applications are facilitated by a third approach — the use of "reporter genes". 

25 Reporter Genes 

Reporter genes are analogous to selectable markers in that they are co- 
transfected into recipient cells with the gene of interest, and provide a means by 
which transfection success may be determined. Unlike selectable markers, 



however, reporter genes typically do not confer any particular advantage to the 
recipient cell. Instead reporter genes, as their name implies, indicate to the 
observer (via some phenotypic activity) which cells have incorporated the reporter 
gene and thus the gene of interest to which it is linked. A number of reporter 
genes have been used, including those operating by biochemical or fluorescent 
mechanisms, each with its own advantages and limitations. 

Biochemical Reporter Genes 

Some commonly used reporter genes encode enzymes or other 
biochemical markers which, when active in the transfected cells, cause some 
visible change in the cells or their environment upon addition of the appropriate 
substrate. Two examples of this type of reporter sequence are the E. coli genes 
lacZ (encoding P-galactosidase or "P-gal") and gusA or iudA (encoding p- 
glucuronidase or "P-glu"); the former is often used as a reporter gene in animal 
cells (Hall, C.V., etal 9 J. Mol Appl Genet 2:101-109 (1983); Cui, C, etaL, 
Trangenic Res. 3:182-194 (1994)), the latter in plant cells (Jefferson, R.A., Nature 
542:837-838 (1989);Watson, ID., etaL, Recombinant DNA, 2nd Ed., New York: 
W.H. Freeman and Co., pp. 281-282 (1992); Hull, G.A., and Devic, M., Meth. 
Mol Biol 49:125-141 (1995)). These bacterial sequences are useful as reporter 
genes because the recipient cells, prior to transfection, express extremely low 
levels (if any) of the en2yme encoded by the reporter gene. When transfected cells 
expressing the reporter gene are incubated with an appropriate substrate {e.g., X- 
gal for p-gal or X-gluc for P-glu), a colored or fluorescent product is formed 
which can be detected and quantitated histochemically or fluorimetrically. 

Another often-used reporter gene is the bacterial gene encoding 
chloramphenicol acetyltransferase (CAT), which catalyzes the addition of acetyl 
groups to the antibiotic chloramphenicol (Gorman, CM., et aL, Mol Cell 
Biol 2:1044-1051 (1982); Neumann, J.R., et aL, BioTechniques 5:444-446 
(1987); Eastman, A., BioTechniques 5:730-732 (1987); Feigner, P.L., et aL, Ann. 



KY. Acad Sci. 772:126-139 (1995)). After transfection, recipient cells are lysed 
and the lysates are incubated with radiolabeled chloramphenicol and an acetyl 
donor such as acetyl-CoA, or with unlabeled chloramphenicol and radiolabeled 
acetyl-CoA (Sleigh, M.J., Anal Biochem. 756:251-256 (1986)). If expressed in 
the cells, CAT transfers acetyl groups to chloramphenicol, which is then easily 
assayed by chromatographic techniques, thereby giving an indication of the 
incorporation of the co-transfected gene of interest by the recipient cells. 

Using reporter genes in this way, populations of cells, or even single cells, 
can be rapidly assayed for their incorporation of the exogenous gene linked to the 
reporter gene. Since they do not rely directly on the expression of the gene of 
interest, assays of transfection success using reporter genes are usually simpler and 
more sensitive than those measuring mRNA or protein production from the 
transgene (Watson, ID., et al, Recombinant DNA, 2nd Ed., New York: W.H. 
Freeman and Co., p. 155 (1992)). However, the use of reporter genes is severely 
limited in that it usually requires sacrifice (fixation) of the cells prior to assay, and 
therefore cannot be used for assaying living cells or cultures. Thus, alternative 
means for determining the incorporation of the transgene in viable cells have been 
developed. 

Fluorescent Reporter Genes 

An example of viable reporter genes that are rapidly gaining widespread 
use are those that are fluorescence-based. These genes encode proteins which are 
either naturally fluorescent or which convert a substrate from nonfluorescent to 
fluorescent. Assays using this type of reporter gene are non-destructive and, 
owing to the availability of sophisticated fluorescence detection systems, are often 
more sensitive than biochemical reporter gene assays. 

One example of a fluorescence reporter gene is the luciferin-luciferase 
system (Bronstein, I., etal.Anal Biochem, 279:169-181 (1994)). This system 
utilizes the gene for luciferase, an ATPase enzyme isolated from fireflies (Gould, 



S.J., and Subramani, S., Anal. Biochem. 775:5-13 (1988)) and other beetles 
(Wood, K.V., etal., J. Biolumin. Chemilumin. 4:289-301 (1989)), or from certain 
bioluminescent bacteria (Stewart, G.S., and Williams, P., J. Gen. Microbiol. 
735:1289-1300 (1992); Langridge, W., etal., J. Biolumin. Chemilumin 9:185- 
200 (1 994)). For use as a reporter gene, the luciferase gene is placed into a vector 
also containing the gene of interest, or separate vectors containing the luciferase 
gene and the gene of interest are mixed together. Cells are then transfected with 
the vector(s) and treated with the luciferase substrate luciferin which is rendered 
luminescent (and impermeant) intracellularly by the action of the luciferase. Cells 
containing the luciferase gene, and thus the gene of interest linked to it, can then 
be rapidly and sensitively observed using luminescence detectors such as 
luminometers. 

To provide a further increase in sensitivity, attempts have been made to 
use genes from certain cyanobacteria which encode naturally fluorescent 
phycobiliproteins such as phycoerythrin and phycocyanin. These proteins are 
among the most highly fluorescent known (Oi, V.T., et al. 7 J. Cell Biol. 93:981- 
986 (1982)), and systems have been developed that are able to detect the 
fluorescence emitted from as little as one phycobiliprotein molecule (Peck, K., et 
al. 7 Proc. Natl. Acad ScL USA £6:4087-4091 (1989)). Phycobiliproteins also 
have the advantage of being naturally fluorescent, thus eliminating the time- 
consuming steps of the addition of exogenous substrates for their detection as is 
required for luciferase and biochemical reporter genes. However, the 
phycobiliproteins have proven extremely difficult to engineer into gene constructs 
in such a way as to maintain their fluorescence (Heim, R., et al. , Proc. Natl. Acad. 
ScL USA 97:12501-12504 (1994)), and thus are not commonly used as reporter 
genes in assaying the transfection of mammalian cells. 

Thus, the ideal reporter gene would encode a naturally fluorescent protein 
(for ease of use following transfection) that is highly fluorescent (for increased 
sensitivity) and easily engineered (for maintenance of fluorescence). Such a 
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system has recently been developed, using the Green Fluorescent Proteins (GFPs) 
isolated from certain marine cnidarians. 

GFP 

Overview 

5 GFPs are involved in bioluminescence in a variety of marine invertebrates, 

including jellyfish such as Aequorea spp. (Morise, H., et al., Biochemistry 
73:2656-2662 (1974); Prendergast, F.G, and Mann, K.G., Biochemistry 77:3448- 
3453 (1978); Ward, ViyN.,Photochem. Photobiol Rev, 4:1-57 (1979) and the sea 
pansy Renilla reniformis (Ward, W.W., and Cormier, M.J., Photochem. 

10 Photobiol. 27:389-396 (1978); Ward, WW., et aL, Photochem. Photobiol. 

37:61 1-615 (1980)). The GFP isolated from Aequorea victoria has been cloned 
and the primary amino acid structure has been deduced (Figure 1; Prasher, D.C, 
et aL, Gene 777:229-233 (1992)) (SEQ ID NOs:l, 2). The chromophore of A. 
victoria GFP is a hexapeptide composed of amino acid residues 64-69 in which 

15 the amino acids at positions 64-67 (serine, tyrosine and glycine) form a 

heterocyclic ring (Prasher, D.C, et aL, Gene J 1 7:229-233 (1992); Cody, CW., 
et al., Biochemistry 32: 1212-1218 (1993)). Resolution of the crystal structure of 
GFP has shown that the chromophore is contained in a central a-helical region 
surrounded by an 1 1-stranded p-barrel (Ormo, M, et aL, Science 273: 1392-1395 

20 (1996); Yang, F., et aL, Nature Biotech. 74:1246-1251 (1996)). Upon 

purification, native GFP demonstrates an absorption maximum at 395 nanometers 
(nm) and an emission maximum at 509 nm (Morise, H., et aL, Biochemistry 
73:2656-2662 (1974);Ward, WW., et aL, Photochem. Photobiol. 37:611-615 
(1980)) with exceptionally stable and virtually non-photobleaching fluorescence 

25 (Chalfie, M, et aL, Science 263:802-805 (1994)). 

While GFP has been used as a fluorescent label in protein localization and 
conformation studies (Heim, R., et aL, Proc. NatL Acad. Sci. USA 91: 1250-1254 
(1994); Yokoe, H„ and Meyer, T., Nature Biotech. 7*1252-1256 (1996)), it has 



gained increased attention in the field of molecular genetics since the 
demonstration of its utility as a reporter gene in transfected prokaryotic and 
eukaryotic cells (Chalfie, M, etal, Science 265:802-805 (1994); Heim, R., et al, 
Proc. Natl Acad ScL USA 97:1250-1254 (1994); Wang, S., and Hazelrigg, T., 
Nature 369:400-403 (1994)). GFP has also been used in fluorescence resonance 
energy transfer studies of protein-protein interactions (Heim, R., and Tsien, R. Y., 
Curr. Biol 6:178-182 (1996)). Since GFP is naturally fluorescent, exogenous 
substrates and cofactors are not necessary for induction of fluorescence, thus 
providing GFP an advantage over the biochemical, luminescent and other 
fluorescent reporter genes described above. Visualization of GFP fluorescence 
does not require the fixation steps necessary with biochemical reporters such as 
P-gal and P-glu, nor does it require extraction from the cell prior to assay as may 
be required with luciferase; thus, GFP is suitable for use in procedures requiring 
continued viability of transfected cells. In addition, since the GFP cDNA 
containing the complete coding region is less than 1 kilobase in size (Prasher, 
DC, etal. Gene 1 J 7:229-233 (1992)), it is easily manipulated and inserted into 
a variety of vectors for use in creating stable transfectants (Chalfie, M., et aL, 
Science 265:802-805 (1994)). 

Despite these advantages, however, the use of wildtype GFP has a few 
limitations. For example, the excitation and emission maxima of wildtype GFP are 
not within the range of wavelengths of standard fluorescence optics (at which GFP 
demonstrates relatively low quantum yield (ie., low intensity of fluorescence)). 
In addition, GFP shows low efficiency of transcription in mammalian cells upon 
transfection and is packaged into low-solubility inclusion bodies in bacteria (thus 
providing difficulty in purification). These limitations have been overcome to a 
limited extent via the introduction of selected point mutations into the sequence 
of wildtype GFP. 
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GFP Mutants 

One of the earliest mutation studies of GFP, in which the tyrosine residue 
at position 66 in the wildtype protein ("wt-GFP") was replaced with a histidine 
residue, resulted in a mutant protein which fluoresced blue instead of green when 
excited with ultraviolet (UV) light (Heim, R., et al.Proc. Natl. Acad. Set USA 
91. 1250-1254 (1994)). This mutant protein not only provided a capacity for two 
distinguishable wavelengths for use in studies comparing independent proteins and 
gene expression events, but also demonstrated that single point mutations in GFP 
could induce drastic changes in the photochemistry of the protein. Three other 
sets of specific point mutations have been shown to increase the excitation and 
emission maxima of GFP such that they fall well within the range of standard 
fluorescein optics (Ehrig, T., et al, FEBS Letts. 367:163-166 (1995); Delagrave, 
S., etal., Bio/Technology 73:151-154 (1995); Heim, R., and Tsien, R., Curr. Biol. 
6:178-182 (1996)), thus permitting the use of GFP with standard laboratory 
fluorescence detection systems. The problem of low quantum yield by wt-GFP 
has been partially addressed by mutating the serine residue at position 65 to a 
threonine ( M S65T"), either without (Heim, R., et ai, Proc. Natl. Acad. ScL USA 
97:1250-1254 (1994)) or with (Cormack, B., et aL t Gene 773:33-38 (1996)) a 
concomitant mutation at position 64, or by mutating other residues in the non- 
chromophore region (Crameri, A., et al, Nature Biotech. 74:315-319 (1996)). 
The S65T mutation also appears to improve the rate of fluorophore formation in 
transfected cells by approximately four-fold over wt-GFP, thus allowing earlier 
and more sensitive detection of transfection with this mutant than with wt-GFP 
(Heim, R, et aL, Proc. Nail. Acad Sci. USA 97:1250-1254 (1994)). By 
combining the S65T mutation with a mutation at position 64 replacing 
phenylalanine with leucine, approximately 90% of the mutant GFP expressed in 
bacteria is soluble, thus improving protein purification and yields (Cormack, B., 
et al, Gene 773:33-38 (1996)). Another series of mutations results in a mutant 
fusion GFP consisting of linked blue- and green-fluorescing proteins which have 
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proven useful in studies of protein localization, targeting and processing (Heim, 
R., and Tsien, R.Y., Curr. Biol 6:178-182 (1996)). Analogously, chimeric 
constructs comprising GFP linked to other proteins have been used in studies of 
ion channel expression and function (Marshall, J., et aL, Neuron 74:211-215 
5 (1995)), and in organelle targeting studies where they have provided a means for 

selectively and distinctively labeling the organelles of living cells (Rizzuto et al, 
Curr. Biol 6:183-188 (1996)). Finally, by combining the S65T mutation with 
other mutations throughout the nonchromophore regions of the wt-GFP gene, a 
"humanized" mutant GFP (SEQ ID NOs:3, 4) has been produced that not only 

10 shows a significant increase in fluorescence intensity and rate of fluorophore 

formation over wt-GFP (via the S65T mutation) but also demonstrates a 22-fold 
increased expression efficiency in mammalian cells (Evans, K., et al, FOCUS 
75^:40-43 (1996); Zolotukhin, S. y etal.,J. Virol. 70:4646-4654(1996)). This 
humanization was achieved via 92 base substitutions (in 88 codons) to the wt-GFP 

15 gene which were amino acid-conservative and which were made to provide a 

pattern of codon usage more closely resembling that of mammalian cells, as 
opposed to the jellyfish codon patterns found in the wt-GFP gene which are less 
efficiently translated in mammalian cells. A summary of these GFP chromophore 
mutants is presented in Table 1 . 
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Table 1. GFP Chromophore Mutants. 



^.v^. - f v. ru 


Amino Acid Residue Number: 


liilli 


Mutant 


64 


65 


66 


Reference 1 


(Wildtype) 


Phe 


Ser 


Tyr 


Prasher et 
aL 7 1992 


GreenLantern-1 


Phe 


Thr 


Tyr 


Evans et al, 
1996 


Humanized GFP 


Phe 


Thr 


Tyr 


Zolotukhin 
etaL, 1996 


Y66H 


Phe 


Ser 


His 


Heim etaL, 
1994 


Y66W 


Phe 


Ser 


Trp 


Y66F 


Phe 


Ser 


Phe 


RSGFP1 


Gly 


Ser 


Tyr 


Delagrave et 
al. y 1995 


RSGFP2 


Leu 


Leu 


Tyr 


RSGFP3 


Gly 


Cys 


Tyr 


RSGFP4 


Met 


Gly 


Tyr 


RSGFP6 


Val 


Ala 


Tyr 


RSGFP7 


Leu 


Cys 


Tyr 


S65A 


Phe 


Ala 


Tyr 


Heim et al, 
1996 


S65L 


Phe 


Leu 


Tyr 


S65C 


Phe 


Cys 


Tyr 


S65T 


Phe 


Thr 


Tyr 


GFPmutl 


Leu 


Thr 


Tyr 


Cormack et 
aL 1996 



20 1 See preceding text for full citations. 



Despite some success in overcoming certain of the above-described 
limitations of GFPs, the sensitivity of GFP as a reporter gene (measured as 
percentage of positive cells) is not as high as that of standard biochemical reporter 
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genes such as 0-gal (Evans, K., et al. y FOCUS 18(2) A0-43 (1996)). In addition, 
the use of GFP as a reporter gene or a protein tag requires the use of fluorescent 
excitation and emission optics, which increases user expense and which is more 
technically challenging than the use of visible or white light optics often used with 
standard reporters such as P-gal. Thus, a need currently exists for additional GFP 
variants which are more highly fluorescent, humanized, rapidly expressed in 
mammalian cells, capable of being observed using standard white light optics, and 
which provide an increased level of sensitivity. 



SUMMARY OF THE INVENTION 



It is thus an object of the present invention to provide mutant GFP cDNAs 
and proteins. In one aspect, the invention relates to such mutant GFP cDNAs 
which, when transfected into prokaryotic (e.g., bacterial) or eukaryotic (e.g., 
mammalian) cells, increase the sensitivity of detection (measured as percentage or 
number of positive cells). The present invention thus provides nucleic acid 
molecules encoding mutant GFPs, wherein the mutant GFPs have an amino acid 
sequence comprising an amino acid residue lacking an aromatic ring structure at 
position 64 and an amino acid residue having a side chain no longer than two 
carbon atoms in length at position 65. Preferably, (a) if the residue at position 64 
is leucine then the residue at position 65 is not cysteine or threonine; (b) if the 
residue at position 64 is valine then the residue at position 65 is not alanine; (c) if 
the residue at position 64 is methionine then the residue at position 65 is not 
glycine; and (d) if the residue at position 64 is glycine then the residue at position 
65 is not cysteine. The invention is particularly directed to such nucleic acid 
molecules encoding mutant GFPs wherein the amino acid residue at position 64 
is alanine, valine, leucine, isoleucine, proline, methionine, glycine, serine, 
threonine, cysteine, alanine, asparagine, glutamine, aspartic acid or glutamic acid, 
most preferably cysteine or methionine. The invention is also particularly directed 
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to such nucleic acid molecules encoding mutant GFPs wherein the amino acid 
residue at position 65 is alanine, glycine, threonine, cysteine, asparagine or 
aspartic acid, most preferably alanine. In particular, the invention provides 
nucleic acid molecules encoding mutant GFPs wherein the amino acid at position 
64 is cysteine or methionine and the amino acid at position 65 is alanine, and 
nucleic acid molecules encoding mutant GFPs having an amino acid sequence as 
set forth in either SEQ ID NO: 5 or SEQ ID NO: 6. 

In additional aspects, the invention provides mutant GFPs encoded by any 
of the above-described nucleic acid molecules, vectors (particularly expression 
vectors) comprising these nucleic acid molecules, host cells (prokaryotic or 
eukaryotic (including mammalian)) comprising these nucleic acid molecules or 
vectors, and compositions comprising plasmid pGreenLantern-2/Al or plasmid 
pGreenLantern-2/A4. The invention also provides methods for producing a 
mutant GFP, comprising culturing the above-described host cells under conditions 
favoring the production of a mutant GFP and isolating the mutant GFP from the 
host cell. The invention also provides mutant GFPs produced by these methods, 
particularly wherein the mutant GFPs emit fluorescent light when illuminated with 
white light. The invention also relates to compositions comprising the above- 
described mutant GFPs. 

The invention is further directed to kits for transfecting a host cell with the 
nucleic acid molecules encoding the present mutant GFPs, such kits comprising 
at least one container containing a nucleic acid molecule encoding a mutant GFP 
such as those described above, which preferably comprises plasmid 
pGreenLantem-2/Al or plasmid pGreenLantern-2/A4 . These kits of the invention 
may optionally further comprise at least one additional container containing a 
reagent, preferably comprising a liposome and most preferably 
UP OreCT AMINE™, for delivering a mutant GFP nucleic acid molecule into a 
host cell. 
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The invention is further directed to kits for labeling a polypeptide with the 
present mutant GFPs, such kits comprising at least one container containing a 
mutant GFP such as those described above, preferably a mutant GFP having an 
amino acid sequence as set forth in SEQ ID NO:5 or SEQ ID NO:6. These kits 
of the invention may optionally further comprise at least one additional container 
containing a reagent for covalently linking this mutant GFP to the target 
polypeptide. 

The fluorescence of all of the GFP mutants provided by the present 
invention is observable with fluorescein optics, making these mutant proteins 
amenable to use in techniques such as fluorescence microscopy and flow 
cytometry using standard FTTC filter sets. In addition, the fluorescence of certain 
of the present GFP mutants, particularly those having amino acid sequences as set 
forth in SEQ ID NOs: 5 and 6, is visible using standard white light optics (e.g., 
incandescent or fluorescent indoor lighting, or sunlight). The nucleic acid 
molecules and mutant GFPs provided by the present invention thus contribute 
improved tools for detection of transfection, for fluorescent labeling of proteins, 
for construction of fusion proteins allowing examination of intracellular protein 
expression, biochemistry and trafficking, and for other applications requiring the 
use of reporter genes. 

Other preferred embodiments of the present invention will be apparent to 
one of ordinary skill in light of the following drawings and description of the 
invention, and of the claims. 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 is a depiction of the nucleotide (SEQ ID NO:l) and deduced 
amino acid (SEQ ID NO:2) sequences of A. victoria Green Fluorescent Protein 
cDNA (after Prasher, D.C, et al, Gene 111:229-233 (1992)). 
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Figure 2 is a depiction of the nucleotide (SEQ ED NO:3) and deduced 
amino acid (SEQ ID NO:4) sequences of humanized A. victoria Green 
Fluorescent Protein cDNA (after Zolotukhin, S., et al, J. Virol 70:4646-4654 
(1996)). 

Figure 3 is a depiction of the amino acid sequence (SEQ ID NO: 5) of the 
Al GFP mutant. 

Figure 4 is a depiction of the amino acid sequence (SEQ ID NO:6) of the 
A4 GFP mutant. 

Figure 5 is a structural map of plasmid pGreenLantern-1 . 

Figure 6 is a structural map of plasmid pGreenLantern-2. 

Figure 7 is a fluorescence photomicrograph of CHO-K1 cells viewed 24 
hours after transfection with the Al GFP mutant (plasmid pGreenLantern-2/Al). 

Figure 8 is a fluorescence photomicrograph of CHO-K1 cells viewed 24 
hours after transfection with the A4 GFP mutant (plasmid pGreenLantern-2/A4). 

Figure 9 is a fluorescence photomicrograph of negative control CHO-K1 
cells viewed 24 hours after transfection with the pGreenLantern-2 backbone. 

Figure 10 is a bar graph demonstrating the fluorescence of CHO-K1 cells 
determined by flow cytometry 24 hours after transfection with various GFP 
mutants. 
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Figure 11 is a bar graph demonstrating the fluorescence of CHO-K1 cells 
determined by flow cytometry 48 hours after transfection with various GFP 
mutants. 

5 Figure 12 is a structural map of plasmid pProEX HTb. 

DETAILED DESCRIPTION OF THE INVENTION 

Overview 

The present invention provides nucleic acid molecules encoding mutant 
GFPs, vectors and host cells comprising these nucleic acid molecules, the mutant 

10 GFP polypeptides, and methods for producing mutant GFPs. Although specific 

plasmids, vectors, promoters, selection methods and host cells are disclosed and 
used herein and in the Examples, other promoters, vectors, selection methods and 
host cells, both prokaryotic and eukaryotic, are well-known to one of ordinary 
skill in the art and may be used to practice the present invention without departing 

15 from the scope of the invention or any of the embodiments thereof 

In the present invention, GFPs with selective point mutations at amino acid 
positions 64 and 65 have been constructed and analyzed. In general, it has been 
discovered in the present invention that when the amino acid residue at position 
64 (phenylalanine in wt-GFP) is mutated to an amino acid lacking an aromatic ring 

20 ( e -g-> alanine, valine, leucine, isoleucine, proline, methionine, glycine, serine, 

threonine, cysteine, asparagine, glutamine, aspartic acid, glutamic acid, lysine, 
arginine or histidine), an increase in fluorescence quantum yield is observed. 
Increased fluorescence intensity is also observed when the amino acid residue at 
position 65 (serine in wt-GFP) is mutated to an amino acid having a side chain 

25 consisting of no more than two carbon atoms {e.g., alanine, glycine, threonine, 

cysteine, asparagine or aspartic acid), which induce a significant "red-shift" in 
excitation maximum from ultraviolet to visible blue wavelengths and a single 
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excitation maximum instead of a dual excitation maximum as in the wildtype 
protein. Together, these general results indicate that in order to construct GFP 
mutants with a dramatic increase in fluorescence intensity from wt-GFP, either 
position 64 or position 65 should contain a reactive amino acid, although 
particular amino acids appear to be preferred at each position as described below. 
Furthermore, it has been unexpectedly discovered that several of the mutant GFPs 
of the present invention, unlike those previously known in the art, will emit 
fluorescence when illuminated by white light (e.g., incandescent or fluorescent 
indoor lighting, or sunlight). 

Accordingly, in the present invention, specific mutations are introduced 
into positions 64 and 65 of the wt-GFP cDNA sequence (SEQ ID NO:l). 
Alternatively, increased expression of the present mutant GFPs may be obtained 
by introducing the preferred mutations into a humanized GFP gene such as that 
described previously (SEQ ID NO:3) (Evans, K., et al, FOCUS J8(2)A0-43 
(1996);Zolotukhin, S., etalj. Virol 70:4646-4654(1996)). 

Construction of GFP Mutants 

Preparation of GFP Plasmids 

The wt-GFP may be cloned from its natural source, Aequorea victoria, as 
described (Prasher, DC, et al, Gene 7/7:229-233 (1992)). More preferably, 
GFP cDNA to be mutated is contained within a plasmid construct or vector, 
preferably an expression vector, suitable for use in transfecting mammalian cells, 
such as pRAY-1 wherein the wt-GFP cDNA is under the control of the human 
cytomegalovirus (CMV) enhancer/promoter (Marshall, J., et aL, Neuron 74:21 1- 
215 (1995)). Most preferably, to provide for optimum expression of the mutant 
GFPs in mammalian cells, the humanized S65T mutant GFP cDNA (Evans, K., et 
al, FOCUS 18(2)A0-43 (1996); Zolotukhin, S., etal, 1 Virol 70:4646-4654 
(1996)) under control of the CMV enhancer/promoter may be used, contained in 
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plasmid pGreenLantem-1 (Figure 5), which is available commercially from Life 
Technologies, Inc. (Rockville, Maryland). 

The above-described plasmids may be used directly for preparation of 
mutant GFP cDNAs according to the present invention. Alternatively, a stop 
codon in the 5' multiple cloning site of pGreenLantem-1 may be shifted out of 
frame by oligonucleotide ligation methods to allow the mutant GFPs of the present 
invention to be used in the construction of fusions between GFP and other 
proteins, as described below. 

Mutations to GFP cDNA 

A variety of random or site-directed mutagenic techniques may be used to 
prepare the mutant GFPs of the present invention. Appropriate methods include 
chemical mutagenesis using, for example, sodium bisulfite or hydroxylamine 
(Myers, R.M., et al y Science 229:242-247 (1985); Sikorski, R.S., and Boeke, 
ID, Meth Enzymol 794:302-318 (1991)), linker insertion mutagenesis (Heflron, 
F, etal., Proc. Natl Acad Sci. USA 75:6012-6016 (1978)), deletion mutagenesis 
(Lai, CJ, and Nathans, D, J. Mol Biol 59:179-193 (1974); McKnight, S.L., and 
Kingsbury, R., Science 2/7:316-324 (1982)), enzyme misincorporation 
mutagenesis (Shortle, D., et aL % Proc. Natl Acad. ScL USA 79:1588-1592 
(1982)), oligonucleotide-directed mutagenesis (Hutchinson, C.A, et al, J. Biol 
Chem. 253:6551-6560 (1978); Zoller, M.J., and Smith, M., Nucl Acids Res. 
70:6487-6500 (1982); Taylor, J.W., et dL % Nucl. Acids Res. 73:8765-8785 
(1985)), and cassette mutagenesis (Lo, K.-M., et al.,Proc. Natl Acad. Sci. USA 
57:2285-2289 (1984); Wells, J.A., et aL 9 Gene 34:3 15-323 (1985)). To improve 
the fidelity and efficiency of mutagenesis, the use of the polymerase chain reaction 
(PCR) in accomplishing GFP mutagenesis by one or more of the foregoing 
methods is preferred (Higuchi, R., et al, Nucl. Acids Res. 76:7351-7367 (1988); 
Leung, D.W., et al 7 Technique 7:11-15 (1989); Clackson, T., and Winter, G., 
Nucl Acids Res. 77:10163-10170 (1989)). 
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Most preferably, mutations are made to GFP cDNA by uracil DNA 
glycosylase (UDG) mutagenesis using PCR amplification (Nisson, P., et al. y PCR 
Meth. AppL 7:120-123 (1991)). In this approach, the plasmid containing GFP 
cDNA, most preferably pGreenLantern-1 comprising humanized S65T GFP 
(Figure 5), is used as the PCR template, and a sense or antisense primer consisting 
essentially of an oligonucleotide containing at least one mismatched nucleotide 
(available commercially from Life Technologies, Inc.; Rockville, Maryland) is 
added to the reaction mixture. Amplification reaction mixtures most preferably 
contain IX PCR buffer, about 10 micromolar each of deoxyATP, deoxyTTP, 
deoxyCTP and deoxyGTP, about 25 picomoles each of sense and antisense 
primers and about 10 nanograms of template. PCR is performed by techniques 
that are routine in the art, and after at least five PCR cycles, samples of the 
reaction mixture are treated with UDG, most preferably for 30 minutes at 37°C, 
as described (Nisson, P., etaL, PCRMeth. AppL 7:120-123 (1991)). 

The mutated GFP nucleic acid molecules preferably will comprise nucleic 
acid sequences encoding mutant proteins in which one or more amino acid 
residues have been mutated from the wildtype amino acid sequence set forth in 
Figure 1 and SEQ ID NO:2. Such mutations may include, for example, 
substitutions, deletions, insertions or modifications, and preferably are amino acid 
substitutions. Particularly preferred are amino acid substitutions occurring in the 
three amino acid chromophore of GFP at residues 64, 65 and 66 of the wildtype 
GFP sequence (Figure 1 and SEQ ID NO:2), wherein the phenylalanine residue 
at position 64 (Phe64), the serine residue at position 65 (Ser65), and the tyrosine 
residue at position 66 (Tyr66), are each individually, or all together, replaced by 
other amino acid residues. More preferred mutant GFPs of the invention include, 
but are not limited to, those with the following substitutions from the wildtype 
GFP sequence shown in Figure 1 and SEQ ED NO:2: 

•serine 65 replaced by threonine (Ser65->Thr); 

•Phe64^Cys and Ser65->Ala (SEQ ID NO:5); 
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•Phe64-»Cys and Ser65-+Thr; 
•Phe64->Leu and Ser65-+Thr; 
•Phe64->Met and Ser65-> Ala (SEQ ID N0:6); 
•Phe64->Met and Ser65->Thr; 
•Phe64->Met, Ser65-»Phe and Tyr66->Phe; 
•Phe64->Met, Ser65->Phe and Tyr66-»Lys; 
•Phe64->Thr and Ser65->Cys; and 
•Phe64->Val and Ser65->Cys 

Other suitable mutations and mutant GFP amino acid sequences may be 
determined by one of ordinary skill without undue experimentation according to 
the methods described herein and others that are known in the art. As a practical 
matter, whether a particular mutation or combination of mutations produces a 
mutant GFP that may have the above-described desirable properties (e.g., higher 
expression in mammalian cells, higher fluorescence intensity under UV or white 
light illumination) may be determined by one of ordinary skill using the mutation, 
transfection, expression and detection methods described in detail below in the 
Examples, as well as using standard techniques that are routine in the art. 

Following mutagenesis by any of the above-described methods, the 
resulting nucleic acid molecules encoding the mutant GFPs may be inserted into 
one or more vectors, such as those described above, which are preferably 
expression vectors. A particularly preferred vector for containing the present 
mutant GFP nucleic acid molecules is p-GreenLantern-2 (Figure 6). Methods for 
producing the mutant GFP-vector constructs will be familiar to those of ordinary 
skill, and are provided in detail below in Example 1 . 

Once they have been constructed, the vectors comprising the mutant GFP 
nucleic acid molecules may be formulated into a variety of compositions, such as 
solutions (e.g., buffer solutions) to be used in transfecting host cells. 
Alternatively, the vector constructs may be purified and stored according to 
standard techniques for handling recombinant DNA plasmid vectors (Sambrook, 
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}., et al, Molecular Cloning, a Laboratory Manual, 2nd Ed., Cold Spring 
Harbor, NY: Cold Spring Harbor Laboratory Press, pp. 1.3-1.20 (1989)). 

More preferably, the mutant GFP-containing plasmid vectors are 
transformed into a competent host cell. Any competent host cell may be used, 
including those of bacteria (e.g., E coll), yeast {e.g., Saccharomyces spp.), insects 
(e.g., Spodoplera spp.) and mammals (e.g., CHO or BHK cells), although a 
competent strain of E. coli such as DH10B (Life Technologies, Inc.; Rockville, 
Maryland) is most preferably used. Transformation of mutagenized GFP cDNAs 
into host cells may be accomplished by any technique generally used for 
introduction of exogenous DNA, including the chemical, viral, electroporation, 
lipofection and microinjection methods that are well-known in the art. Particularly 
preferred methods for transformation include electroporation and liposome- 
mediated transfection (lipofection), the latter most preferably being accomplished 
using LIPOFECT AMINE™ (Life Technologies, Inc.; Rockville, Maryland). 

After expansion of transformed cultures, mutated GFP cDNA is isolated 
from the host cells by routine methods (Sambrook, J., et al, Molecular Cloning, 
a Laboratory Manual, 2nd Ed., Cold Spring Harbor, NY: Cold Spring Harbor 
Laboratory Press, pp. 1.21-1.52 (1989)) and is subcloned into a plasmid backbone 
for use in subsequent transfections. Most preferably, this plasmid backbone is the 
pGreenLantern-2 backbone (see Figure 6) which contains a universal sequencing 
primer downstream from a CMV enhancer promoter and an Nsil site immediately 
upstream of the CMV promoter allowing excision of the promoter region, along 
mthXbal, Xhol and Hindttl sites in place of the 3' Noil site in pGreenLantern-1 
(Figure 4). 

Fusion sequences of GFP cDNA with nucleotide sequences encoding 
proteins of interest may be prepared by cloning the desired sequence(s) into 
pGreenLantem-2 at the 5 1 multiple cloning site using standard techniques. These 
fusion constructs allow the use of the mutant GFPs of the present invention as 
reporters of transfection efficiency. In addition, fusion constructs such as these 
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will allow a direct examination of the expression, biochemistry and localization of 
the fused proteins intracellularly. 

Alternatively, to examine the structure and function of regulatory 
sequences (e.g., promoters, enhancers, inhibitors) in native genes, the GFP mutant 
cDNAs may be directly transfected or inserted, using routine methods, into target 
genomic or extrachromosomal DNA sequences in host cells (Chalfie, M, et al. y 
Science 2(55:802-805 (1994)). 

Transfection of Hosts With GFP Mutants 

Target cells to be transfected with cDNAs comprising mutant GFPs (either 
fused or unfused to accessory sequences) are grown and maintained in culture 
according to routine methods. Cells may be transfected with mutant GFP cDNA 
by any method described above, although electroporation or liposome-mediated 
transfection (particularly using LIPOFECT AMINE™) are preferred. Following 
transfection, cells are incubated for 12-48 hours, preferably 18-24 hours and most 
preferably for about 24 hours. Transfected cells may then be examined for the 
expression of mutant GFP, manifested as green intracellular fluorescence. With 
standard optical filters routinely used for examining fluorescein (typically 
excitation wavelength of about 475 nm, dichroic filter of 485 nm, emission 
wavelength of about 490 nm), this fluorescence may be examined qualitatively, for 
example by fluorescence microscopy, or quantitatively, for example by 
spectrofluorimetry or flow cytofluorimetry. In addition, transfected cells 
expressing relatively high amounts of mutant GFPs of the present invention may 
be separated from non-transfected cells, or from those expressing lower levels of 
GFP, by fluorescence-based single cell separation techniques such as fluorescence- 
activated cell sorting. Alternatively, transfected cells expressing mutant GFPs that 
fluoresce under white light illumination, particularly those having amino acid 
sequences as set forth in SEQ ID NOs: 5 and 6, may be examined by the above- 
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described qualitative and quantitative methods using standard white light optics 
(e.g., incandescent or halogen lighting, or sunlight). 

These transfected host cells may also be used in methods for the 
production of mutant GFPs of the invention. Such methods may comprise, for 
example, culturing the above-described host cells under conditions favoring the 
production of the mutant GFPs by the host cells, and isolating the mutant GFPs 
from the host cells and/or the culture medium in which the host cells are cultured. 
Typical host cell culture conditions favoring production of recombinant proteins, 
such as the present mutant GFPs, are well-known in the art (see, e.g., Sambrook, 
J., et al, Molecular Cloning, a Laboratory Manual, 2nd Ed., Cold Spring 
Harbor, NY: Cold Spring Harbor Laboratory Press (1989)). The mutant GFPs 
produced by these methods may then be isolated by any of a number of protein 
purification techniques, such as chromatography (preferably affinity 
chromatography, HPLC or FPLC), salt extraction (such as ammonium sulfate 
precipitation), electrophoresis, dialysis, or a combination thereof, to produce 
isolated mutant GFPs of the invention. These mutant GFPs may then be stored 
until use (preferably at temperatures below 0°C, more preferably at about -20 °C 
to about -70°C), or they may be formulated into compositions. Preferred such 
compositions may comprise, for example, one or more of the mutant GFPs of the 
invention and one or more additional components, such as one or more buffer 
salts, one or more inorganic salts or ions thereof, one or more detergents, one or 
more preservatives, and the like, preferably in an aqueous or organic solvent. 

Detection Methods 

In additional embodiments, the invention relates to methods of detecting 
the presence of a mutant GFP, or of a cell (such as a prokaryotic or eukaryotic, 
including mammalian, cell) expressing a mutant GFP. Such methods of the 
invention may comprise, for example, illuminating the mutant GFP or cell 
expressing the mutant GFP with a source of white light under conditions such that 
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the mutant GFP or cell expressing the mutant GFP emits visible fluorescent light. 
In the present methods, the illumination source may be any light source emitting 
white (i.e., visible) light, including but not limited to an incandescent light source, 
a fluorescent light source, a halogen light source, sunlight, and the like. When 
illuminated by such a white light source, mutant GFPs, such as those of the 
present invention, will emit fluorescent light of various visible wavelengths 
(depending upon the specific mutations contained in the mutant GFP, as described 
above), which may be detected by eye or by any of the above-described qualitative 
or quantitative mechanical means. 

Kits 

In other preferred embodiments, the compositions of the present invention 
may be assembled into kits for use in transfecting host cells with the nucleic acid 
molecules encoding the present mutant GFPs, or for labeling target polypeptides 
with the present mutant GFPs. Host cell transfection kits according to the present 
invention may comprise at least one container containing one or more of the 
above-described nucleic acid molecules encoding a mutant GFP (or a composition 
comprising one or more of the nucleic acid molecules or plasmids described 
above), which nucleic acid molecule preferably comprises plasmid pGreenLantern- 
2/A1 or plasmid pGreenLantern-2/A4 (see Example 1 below). These transfection 
kits of the invention may optionally further comprise at least one additional 
container which may contain, for example, a reagent for delivering the mutant 
GFP nucleic acid molecule into a host cell; in preferred kits, this reagent may 
comprise a liposome and most preferably LIPOFECT AMINE™. Polypeptide 
labeling kits according to the present invention may comprise at least one 
container containing, for example, a mutant GFP such as those described above 
(or a composition of the invention comprising a mutant GFP), which is preferably 
a mutant GFP having an amino acid sequence as set forth in SEQ ED NO: 5 or 
SEQ ID NO:6. These labeling kits of the invention may optionally further 
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comprise at least one additional container which may contain, for example, a 
reagent for covalently linking the mutant GFP to the target polypeptide. 

Use of Mutant GFPs 

The mutant GFPs and kits of the present invention may be used in a variety 
of applications. For example, the mutant GFP cDNAs are useful as reporter genes 
that allow a determination of transfection efficiency and success (Chalfie, M., et 
a/., Science 263:802-805 (1994)). Alternatively, the mutant proteins themselves 
may be used as fluorescent labels suitable for detectably labeling other proteins, 
nucleic acids or particulates to be used in a variety of applications (Heira, R., et 
al^Proc. Natl. Acad ScL USA 97:12501-12504 (1994); Yokoe, H., and Meyer, 
T., Nature Biotech 74:1252-1256 (1996)), such as labeling antibodies used in 
infectious disease diagnostic methods; mutant GFPs may be attached to target 
polypeptides and proteins by a variety of methods that are well-known to one of 
ordinary skill in the art, including the use of chemical coupling reagents. In 
addition, fusion complexes between GFP and other proteins may be constructed 
to allow closer and more sensitive determinations of the expression, biochemistry, 
localization and trafficking of intracellular proteins in many host cells (Heim, R., 
etaL 9 Proa Natl Acad ScL USA 97:12501-12504 (1994); Wang, S., and Tulle, 
H., Nature 369:400-403 (1994); Marshall, J., etal., Neuron 14:21 1-215 (1995); 
Rizzuto, R, etal, Curr. Biol 6:183-188 (1996)). Importantly, use of the mutant 
GFPs that emit fluorescence when illuminated by white light will spare the user 
considerable expense and technical difficulty that can accompany the use of 
fluorescent optics for the examination of fluorescent reporter genes such as GFP. 

It will be readily apparent to one of ordinary skill in the relevant arts that 
other suitable modifications and adaptations to the methods and applications 
described herein are obvious and may be made without departing from the scope 
of the invention or any embodiment thereof Having now described the present 
invention in detail, the same will be more clearly understood by reference to the 
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following examples, which are included herewith for purposes of illustration only 
and are not intended to be limiting of the invention. 

Examples 

Example 1: Construction of Mutant GFP cDNAs 

Plasmids. As depicted in Figure 5, pGreenLantern-1 (Life Technologies, 
Inc., Rockville, Maryland; catalogue no. 10642) contains the humanized S65T 
mutant GFP cDNA (Figure 2; SEQ ID NOs:3, 4) (Evans, K., et ai, FOCUS 
J8(2):40~43 (1996); Zolotukhin, S.,etal.,J. Virol. 70:4646-4654(1996)). This 
plasmid serves as the source of the GFP DNA sequence used for mutagenesis. As 
depicted in Figure 6, pGreenLantern-2 contains a universal sequencing primer 
downstream of the CMV promoter along with an Nsil site immediately upstream 
of the CMV promoter allowing excision of the promoter region. It also contains 
Xbal y Xhol and HindUl sites in place of the 3* Noil site in pGreenLantern-1 . A 
stop codon in the 5' multiple cloning site of pGreenLantern-1 was shifted out of 
frame to allow possible fusions to GFP in pGreenLantern-2. 

Mutations to GFP cDNA by UDG cloning. PCR was performed in an MJ 
Research DNA Engine™ thermal cycler using the following conditions: 94 °C for 
60 seconds, 94°C for 30 seconds, 55°C for 30 seconds and 72°C for 4 minutes, 
repeated for 20 cycles. Sense oligonucleotide primers containing specific 
mismatches to the wt-GFP sequence (SEQ ID NOs:7-15; Table 2) were obtained 
from Life Technologies, Inc. (Rockville, Maryland). 
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Table 2. Sense Oligonucleotides Used for UDG Cloning Mutations. 



Vector 


Amino 
Acid 
Mutations 


Single-stranded 
Oligonucleotide Sequence 
(5' to 3') 


SEQID 
INU: 


p GreenLan tern - 
21 A I 


Lyso4, AJaoj 


LAACACUGGUCACUACCTG- 
CGCCTATGGCGTGC 


7 


pGreenLantem - 

Ufa 


Lyso4, inrco 


LLAACACUGGUCACUACCT- 
GCACCTATGG 


8 i 


p GreenLan tern- 

2/A3 


Leu64, Thr65 


CAACACUGGUCACUACCCT- 
CACCTATGGCGTGCAGT 


9 


pGreenLantem- 
2/A4 


Met64, Ala65 


CAACACUGGUCACUACAAT- 
GGCCTATGGCGTGCAGTGCT 


10 


pureenLantem- 
2/A5 


Met64, 
Thr65 . 


CAACACUGGUCACUACCAT- 
GACCTATGGCGTGCAGTGCT 


11 


pGreenLantern- 
2/A6 


Meto4, 
Phe65, Phe66 


CAACACUGGUCACUACCAT- 
GTTCTTCGGCGTGCAGTGCT 


12 


pGreenLantern- 
2/A7 


Met64, 
Phe65, Lys66 


CAACACUGGUCACUACCAT- 
GTTCAAGGGCGTGCAGTGCT 


13 


pGreenLantern- 
2/A8 


Thr64, Cys65 


CAACACUGGUCACUACCAC- 
ATGCTATGGCGTGCAGT 


14 


pGreenLantem- 
2/A9 


Val64, Cys65 


CAACACUGGUCACUACCGT- 
GTGCTATGGCGTGCAGT 


15 



The antisense oligonucleotide primer used for each mutation set had the 
following sequence: 5'-AGU-GAC-CAG-UGU-UGG-CCA-AGG-CAC-AGG- 
GAG-CTT-3* (SEQ ID NO: 16). The template plasmid used was pGreenLantern- 1 
(Figure 5) with a universal reverse sequencing primer incorporated into the 
backbone. Amplifications reactions contained IX PCR buffer, 10 micromolar 
deoxynucleoside triphosphates, 25 picomoles of each primer (sense and antisense) 
and 10 nanograms of template DNA in a 50 microliter volume. After 6, 9 and 20 
PCR cycles were completed, 10 microliter samples were taken and checked via 
agarose gel electrophoresis for excess background. Two 20 microliter samples of 
each 6-cycle aliquot were digested with Dpn\ at 37°C for 30 minutes, then at 
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75 °C for 15 minutes and allowed to cool to room temperature. One of the 
samples from each reaction (four samples in all) was treated with one unit of uracil 
DNA glycosylase (UDG) at 37°C for 30 minutes (Nisson, P., et al., PCRMeth 
Appl 7:120-123 (1991)). PCR samples were then transformed into 100 
microliters of MAX Efficiency DH10B™ Competent Cells (Life Technologies, 
Inc.; Rockville, Maryland). The mutated portion of the GFP cDNA was then 
subcloned with a Notl and BamHl digest into the pGreenLantern-2 backbone 
(Figure 6) which was not subjected to PCR (Sambrook, J., et al y Molecular 
Cloning, a Laboratory Manual, 2nd Ed., Cold Spring Harbor, NY: Cold Spring 
Harbor Laboratory Press (1989)). This approach yielded nine separate mutant 
GFP plasmid vectors, designated pGreenLantern-2/Al through pGreenLantern- 
2/A9 (Table 2), each with a specific mutation or set of mutations within the GFP 
chromophore region at amino acids 64-66. 

Example 2: Growth and Transfection of Host Cells With Mutant GFPs 

Cell Culture. Chinese hamster ovary cells (CHO-K1, obtained from 
American Type Culture Collection (ATCC), Rockville, Maryland) were cultured 
in D-MEM (4,500 milligrams/liter D-glucose with L-glutamine and phenol red) 
plus 10% fetal bovine serum (FBS), 0.1 millimolar nonessential amino acids, 2.5 
units per milliliter penicillin and 2.5 micrograms per milliliter streptomycin 
(Freshney, R.I., Culture of Animal Cells: A Manual of Basic Techniques, 3rd Ed., 
New York: Wiley-Liss (1994)). Cells were grown at 37°C in a 5% COj/air 
incubator. All media and reagents were from Life Technologies, Inc., Rockville, 
Maryland. 

Transfection. CHO-K1 cells were plated at 2 x 10 5 cells per well into six- 
well (35 millimeter diameter) plates one day prior to transfection. Immediately 
before transfection, cells were rinsed with medium containing no serum or 
antibiotics. LIPOFECT AMINE™ reagent was diluted into 100 microliters of 
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OPTI-MEM-1 Reduced Serum Medium (without FBS) to give a final 
concentration of LIPOFECT AMINE of 6 microliters per well. DNA was diluted 
separately to a concentration of 1 microgram per well in 100 microliters of OPTI- 
MEM-I. Transfection complexes were formed by combining diluted lipid and 
DNA and incubating for 30 minutes prior to addition to cells. Transfection 
complexes were then diluted 1 :5 with D-MEM containing no FBS or antibiotics 
and added to the rinsed cells. Cells were transfected for five hours at 37°C, then 
fed with an equal volume of D-MEM containing 20% FBS, 0.1 millimolar 
nonessential amino acids, and no antibiotics. Cells were grown overnight at 37 °C, 
5% C0 2 /air. In some studies, cells were grown for 48 hours; in these studies, 
transfection complexes were removed from cells 24 hours after addition and cells 
were fed with 2 milliliters per well of complete medium. 

Regardless of the vector used, host cells transfected with the mutant GFP 
genes demonstrated approximately equivalent growth rates as control cells 
transfected with the wildtype GFP gene or with other reporter genes (e.g., P-gal). 
These results indicate that transfection with the mutant GFP cDN As of the present 
invention does not adversely affect the growth or culturability of the host cells 
more than transfection with any other reporter vector. 

Example 3: Characterization of GFP Mutants Expressed in Eukaryotic Cells 

Formalin Fixation. Transfected host cells were rinsed in Dulbecco's 
Phosphate Buffered Saline (PBS), then fixed in a solution of 1 0% formalin in PBS 
for one hour. Formalin was then removed, and cells were rinsed and stored in 
PBS at 4 °C until being analyzed. 

Fluorescence Microscopy. Formalin-fixed cells were examined and 
photographed using an inverted phase contrast fluorescence microscope equipped 
with FITC filters (excitation 475 nm/dichroic 485 nm/barrier 490 nm) and a 50 
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watt mercury arc bulb at 1 .25 volts. A 40X-power adjustable non-phase objective 
was used for all micrographs, which were taken through blue, neutral and FITC 
filters using Kodak Ektachrome ASA 400 Daylight (for slides) or Kodak Gold 
ASA 400 Daylight (for prints). All exposures were for 12 seconds to allow 
unbiased comparison of fluorescence intensity. 

Flow Cytofluorimetry. Flow cytofluorimetry was performed on 
transfected CHO-K1 cells that were trypsinized and suspended in PBS plus 10% 
formalin at a concentration of less than 10 6 cells per milliliter. Measurements were 
made on a Coulter EPICS® XL-MCL flow cytometer using a 1 5 megawatt argon 
ion laser. Filters used were 488 nm excitation, 500 nm dichroic LP/525 nm band 
pass for FL1 (green channel) and 575 band pass/600 nm dichroic LP for FL2 
(orange channel). Samples consisted of 20,000 events using PMT voltages of 1 00 
volts for side scatter and forward scatter, 496 volts for FL1 and 505 volts for FL2, 
all with integral gain set to 1.0. Color compensation included 7.9% orange signal 
in FL1 and 3.2% green signal in FL2. 

Results. As shown in Table 3, the GFP mutants of the present invention 
displayed varying intensities and kinetics of formation in transfected cells. Two 
of these mutants, designated "Al" (phenylalanine mutated to cysteine at position 
64; serine mutated to alanine at position 65; Figure 3; SEQ ID NO: 5) and "A4" 
(phenylalanine mutated to methionine at position 64; serine mutated to alanine at 
position 65; Figure 4; SEQ ID NO:6) were exceptionally bright. As shown in 
Figures 7-9, CHO cells transfected with plasmid pGreenLantern-2/Al (Figure 7) 
or with plasmid pGreenLantern-2/A4 (Figure 8) demonstrated a dramatic increase 
in green fluorescence intensity over cells transfected with the humanized S65T 
mutation of pGreenLantern-1 (Figure 9) when viewed at 24 hours post- 
transfection using FITC optics. 
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Table 3. Effects of Point Mutations on GFP Fluorescence Intensity. 



Vector 


Amino Acids 


Fluorescence Results 


Wildtype GFP 


Phe64, Ser65 


A cx =395 nm (major), 470 nm (minor); 48 
hours required for detection 


S65T 


Phe64, Thr65 


6-fold increase in intensity over wildtype 


pGreenLantem-1 


Phe64, Thr65 
(humanized) 


22-fold increase in intensity over wildtype 


pGreenLantern- 
2/A1 


Cys64, Ala65 


6-fold increase in intensity over S65T 


pGreenLantern- 
2/A2 


Cys64, Thr65 


22-fold increase in intensity over wildtype 


pGreenLantern- 
2/A3 


Leu64, Thr65 


6-fold increase in intensity over S65T 


pGreenLantern- 
2/A4 


Met64, Ala65 


6-fold increase in intensity over S65T 


pGreenLantern- 
2/A5 


Met64, Thr65 


Slight increase in intensity over 
pGreenLantern-1 


pGreenLantern- 
2/A6 


Met64, Phe65, 
Phe66 


Equivalent to wildtype 


pGreenLantern- 
2/A7 


Met64, Phe65, 
Lys66 


Equivalent to wildtype 


pGreenLantern- 
2/A8 


Thr64, Cys65 


Equivalent to wildtype 


pGreenLantern- 
2/A9 


Val64, Cys65 


Slight increase in intensity over 
oGreenLantern-1 



Other mutants produced in the present studies were less satisfactory 
25 (Table 3). For example, mutants A5 (phenylalanine mutated to methionine at 

position 64; serine mutated to threonine at position 65) and A9 (phenylalanine 
mutated to valine at position 64; serine mutated to cysteine at position 65) gave 
only slightly better fluorescence than the humanized S65T mutation of 
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pGreenLantern-1. It is possible that the highly reactive cysteine at position 65 in 
mutant A9 may interfere with the formation of the three amino acid heterocyclic 
ring required for GFP fluorescence (Cody, C.W., Biochemistry 52:1212-1218 
(1993)). 

Mutant A2 (phenylalanine mutated to cysteine at position 64; serine 
mutated to threonine at position 65) was equal in fluorescence to the humanized 
S65T pGreenLantern-1 (Evans, K., et aL, FOCUS I8(2)A0-43 (1996); 
Zolotukhin, S., et al y J, Virol 70:4646-4654 (1996)), while mutants A6 
(phenylalanine mutated to methionine at position 64; serine mutated to 
phenylalanine at position 65; tyrosine mutated to phenylalanine at position 66), A7 
(phenylalanine mutated to methionine at position 64; serine mutated to 
phenylalanine at position 65; tyrosine mutated to lysine at position 66) and A8 
(phenylalanine mutated to threonine at position 64; serine mutated to cysteine at 
position 65) demonstrated a decreased fluorescence intensity and were, in fact, 
equivalent to wt-GFP. No shift in excitation or emission spectra was detected 
with these three mutants, however, as no fluorescence was observed using 
ultraviolet or rhodamine filter combinations. 

These results were also observed via flow cytometry. As shown in Figure 
10, CHO-K1 cells transfected with the Al and A4 mutant GFPs demonstrated a 
dramatic increase in fluorescence over wildtype and A6-A8 mutants within 24 
hours of transfection. This high level of fluorescence was maintained, particularly 
for cells transfected with the A4 mutant GFP, for at least 48 hours after 
transfection (Figure 11). 

Mutations at certain amino acid positions outside the chromophore were 
also examined for their effects on GFP fluorescence. Mutation of Gln69->Asn in 
the A4 mutant resulted in a dramatic decrease in fluorescence relative to the A4 
mutant itself, as did mutation of Vail 63 -» Ala and IIel67-*Thr in the A4 mutant. 

Together, these results indicate that the most preferable mutations for 
providing highly fluorescent, rapidly expressed GFPs are those in which only one 
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reactive amino acid is present at either position 64 or 65, as in the Al 
(Phe64->Cys; Ser65~+Ala; SEQ ID NO: 5) and A4 (Phe64->Met, Ser65->Ala; 
SEQ IDNO:6) mutants. 

Example 4: Characterization of GFP Mutants Expressed in Prokaryotic Cells 

To examine the efficacy of expressing mutant GFPs in prokaryotic cells, 
mutant GFP cDNAs were subcloned into the bacterial pProEX HTb vector 
(Figure 12). GFP cDNA was excised by Noil and Xbdl digestion from 
pGreenLantern-2 (Figure 6) containing the mutations at positions 64, 65 and/or 
66 (mutants Al through A9) shown in Table 3. The bacterial vector pProEX HTb 
(Figure 12) was also digested with the same enzymes. The pProEX HTb 
backbone and GFP fragments were ligated, to form the corresponding transfection 
vectors containing the respective mutant GFP fragments: pProEXAl, pProEXA2, 
pProEXA3, P ProEXA4, pProEXA5, pProEXA6, pProEXA7, pProEXA8 and 
pProEXA9. These vectors were then individually transformed into 100 \i\ of 
DH10B E, coli host cells; control cells were also prepared that had been 
transfected with a construct containing the S65T mutant described in Examples 
1-3 above. Cells were plated onto ampicillin/EPTG plates and incubated overnight 
at 37°C, and colonies were then picked and screened for fluorescence under long 
ultraviolet (UV) or blue illumination. 

Colonies containing the Al, A2, A3, A4, A5, A9 and S65T mutant GFPs 
all demonstrated green fluorescence when illuminated with long UV or blue light, 
while those containing the A6, A7 and A8 mutant GFPs demonstrated no 
fluorescence under these conditions. These results are consistent with those 
observed in eukaryotic cells, as shown in Example 3 above, and indicate that 
mutant GFPs may be successfully transfected into and expressed in prokaryotic 
cells. 
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Example 5: Visible Light Excitation of GFP Mutants 

To examine the ability of mutant GFPs to emit fluorescence when 
illuminated by white light, E. coli cells were transfected and plated as described 
above in Example 4. Colonies were then picked and examined for fluorescence 
upon illumination by incandescent light, fluorescent indoor lighting, or sunlight. 

Upon induction of the host cells with EPTG, cells transformed with the 
vector comprising the A4 GFP mutation unexpectedly exhibited bright green light 
emission under normal daylight conditions, without the need for excitation with 
UV light. Similar results were observed for cells transformed with the A3 mutant 
GFP. Cells containing the Al and A5 mutant GFPs were also seen to be less (but 
still observably) fluorescent under white light illumination. Conversely, only very 
weak emission of light was observed under white light illumination in the cells 
transformed with the vectors comprising only the S65T, A2 and A9 mutations. 
Cells comprising the A6, A7 and A8 mutations exhibited no fluorescence when 
illuminated by white light. 

When plates containing these mutants were stored in the dark at 4°C for 
38 days, however, all of the colonies except those containing the A6, A7 or A8 
mutant GFPs were seen to be more intensely fluorescent under white light 
illumination. Colonies containing the A3, A4 and A5 mutants were more 
fluorescent under these conditions than were those containing the Al, A2, A9 and 
S65T mutants, although all colonies fluoresced more brightly than they did in 
freshly plated cells (i.e., when observed within 24-48 hours of transfection). 
When these plates were allowed to warm to room temperature, the fluorescence 
in colonies containing the Al, A2, A9 and S65T mutants decreased, while that in 
colonies containing the A3, A4 and A5 mutants remained brightly fluorescent. 

It is possible that the increased fluorescence observed in stored plates may 
have been due to accumulation of mutant protein in the cells over time in storage, 
indicating a dependence of white light fluorescence upon intracellular 
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concentration of the GFP. To test this notion, a 6His-tagged A4 GFP construct 
prepared and isolated by metal affinity chromatography according to standard 
techniques (see Ausubel, F.M., et al. y in Current Protocols in Molecular Biology, 
New York: John Wiley & Sons, Inc., pp. 10.11.10-10.11.24 (1996)), was 
examined for fluorescence under blue, red and white light at various protein 
concentrations in solution. At a concentration of about 1 .5 ^g/ml, the purified A4 
GFP was brightly fluorescent under sunlight and fluorescent indoor white lighting, 
as well as under blue light; no fluorescence was observed, however, under red 
light. This highly concentrated A4 GFP solution became nonfluorescent upon 
boiling, but was at least slightly fluorescent up to a temperature of about 82 °C. 
When diluted to 0. 1 ng/ml, however, the A4 GFP solution fluoresced brightly 
under blue light (closer in wavelength to the excitation maximum of GFP which 
is in the UV range), but did not fluoresce under white light illumination. These 
results suggest that the increased fluorescence observed upon white light 
illumination of colonies stored for extended periods of time may be due to 
accumulation of GFP protein in the cells. 

Taken together, these results indicate that prokaryotic cells containing the 
A3 or A4 mutant GFPs, and to a lesser extent the Al and A5 mutant GFPs, can 
emit light without the addition of an exogenous substrate or the use of ultraviolet 
irradiation. Use of these GFP constructs thus provides advantages over other 
visible light reporter vectors which require the use of exogenous substrates, and 
over other fluorescent reporter vectors which require UV irradiation which may 
induce undesirable mutations in the host cells. 

Example 6: Additional GFP Mutations 

To examine the effects of alternative point mutations on GFP fluorescence, 
mutations are targeted at the tryptophan residue at position 67 (the only 
tryptophan residue in the entire GFP molecule which is located in the unique motif 
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Pro-Val-Pro-Trp-Pro (SEQ ID NO: 17)). To accomplish this mutation, 
oligonucleotides are designed to mutate Trp57-*His or Trp57->Tyr, in 
conjunction with the Ser65-*Thr mutant (SEQ ID NO:4) or the Phe64-»Met; 
Ser65-+Ala mutant (SEQ ID NO:6). These mutants are made in the bacterial 
5 vector pProEX HTb as described in Example 4, using specific oligonucleotides 

designed to provide the desired mutations. The vector constructs are then 
transfected into host cells and characterized as above for their fluorescence. 

In a similar fashion, mutations are made at other amino acid positions 
outside of the GFP chromophore region. For example, mutations are made at 
10 Arg96, which is probably responsible for stabilizing resonance structures of the 

imidazolidone 5-membered ring during ring formation and possibly during 
excitation, and is therefore a target for more rapid ring formation and, hence, 
faster detection of fluorescence. Mutations involving this residue include 
Arg96-*His. 

15 Mutations are also possible at Phe46, which along with Phe64 separates 

the 5-membered chromophore ring from direct contact with the single tryptophan 
in the Ser65->Thr GFP (SEQ ID NO:4). By allowing direct hydrogen bonding 
between Trp57 and the ring structure, efficient energy transfer is possible as with 
the Phe64-*Leu; Ser64— >Thr mutant. Mutations involving this residue include 

20 Phe46-*Leu or other hydrophobic residues that promote hydrogen bonding. 

Mutations are also made at Leu221 and Phe223, which are involved in 
dimer formation. Only three hydrophobic residues are in the dimer contact region; 
all others are hydrophobic. By mutating Leu221 and/or Phe223 to a hydrophilic 
or "neutral" residue such as glycine, GFP aggregation, which can be a problem 

25 with GFP fusion constructs, may be inhibited. 

Mutations are also made at His 148, which probably stabilizes the 
fluorophore and forms hydrogen bonds with Tyr66 and Gln94. Mutations of 
His 148 to a residue with a different charge or a different pKa are made to allow 
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alteration of the excitation and emission spectra of GFP, similar to results seen 
with Tyr66-*His which results in blue fluorescence by GFP. 

Finally, mutations introducing a second 5-membered ring structure into 
the a-helix of GFP are made, to allow increased fluorescence intensity of the 
resultant GFP. 

Having now fully described the present invention in some detail by way of 
illustration and example for purposes of clarity of understanding, it will be obvious 
to one of ordinary skill in the art that the same can be performed by modifying or 
changing the invention within a wide and equivalent range of conditions, 
formulations and other parameters without affecting the scope of the invention or 
any specific embodiment thereof, and that such modifications or changes are 
intended to be encompassed within the scope of the appended claims. 

All publications, patents and patent applications mentioned in this 
specification are indicative of the level of skill of those skilled in the art to which 
this invention pertains, and are herein incorporated by reference to the same extent 
as if each individual publication, patent or patent application was specifically and 
individually indicated to be incorporated by reference. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Life Technologies, Inc. 

(B) STREET: 9800 Medical Center Drive 

(C) CITY: Rockville 

(D) STATE: Maryland 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 20850 

(ii) TITLE OF INVENTION: Mutants of Green Fluorescent Protein 
<iii) NUMBER OF SEQUENCES : 17 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 

(v) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: (To be assigned) 

(B) FILING DATE: 17 -NOV- 1997 

<vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US (To be assigned) 

(B) FILING DATE: 14 -NOV- 1997 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/030,935 

(B) FILING DATE: 15 -NOV- 1996 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: gfplO 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..714 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: I: 

ATG AGC AAG GGC GAG GAA CTG TTC ACT GGC GTG GTC CCA ATT CTC GTG 48 
Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val 
1 5 10 15 

GAA CTG GAT GGC GAT GTG AAT GGG CAC AAA TTT TCT GTC AGC GGA GAG 96 
Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
20 25 30 

GGT GAA GGT GAT GCC ACA TAC GGA AAG CTC ACC CTG AAA TTC ATC TGC 144 
Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
35 40 45 

ACC ACT GGA AAG CTC CCT GTG CCA TGG CCA ACA CTG GTC ACT ACC TTC 192 
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe 
50 55 60 

ACC TAT GGC GTG CAG TGC TTT TCC AGA TAC CCA GAC CAT ATG AAG CAG 240 
Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
65 70 75 80 

CAT GAC TTT TTC AAG AGC GCC ATG CCC GAG GGC TAT GTG CAG GAG AGA 288 
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
85 90 95 

ACC ATC TTT TTC AAA GAT GAC GGG AAC TAC AAG ACC CGC GCT GAA GTC 336 
Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 110 

AAG TTC GAA GGT GAC ACC CTG GTG AAT AGA ATC GAG TTG AAG GGC ATT 384 
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
115 120 125 

GAC TTT AAG GAA GAT GGA AAC ATT CTC GGC CAC AAG CTG GAA TAC AAC 432 
Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 

TAT AAC TCC CAC AAT GTG TAC ATC ATG GCC GAC AAG GAA AAG AAT GGC 480 
Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 

ATC AAG GTC AAC TTC AAG ATC AGA CAC AAC ATT GAG GAT GGA TCC GTG 528 
He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 
165 170 175 

CAG CTG GCC GAC CAT TAT CAA CAG AAC ACT CCA ATC GGC GAC GGC CCT 576 
Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
180 185 190 

GTG CTC CTC CCA GAC AAC CAT TAC CTG TCC ACC CAG TCT GCC CTG TCT 624 
Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 
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AAA GAT CCC AAC GAA AAG AGA GAC CAC ATG GTC CTG CTG GAG TTT GTG 672 
Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
210 215 220 

ACC GCT GCT GGG ATC ACA CAT GGC ATG GAC GAG CTG TAC AAG 714 
Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 
225 230 235 

TGA 717 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 238 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 



Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val 
15 10 15 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
20 25 30 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
35 40 45 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe 
50 55 60 

Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
65 70 75 80 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
85 90 95 

Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 110 



Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
115 120 125 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 

130 135 140 

Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 

145 150 155 160 
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Ile Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser Val 
165 170 175 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly Pro 
180 185 190 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 

Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
210 215 220 

Thr Ala Ala Gly lie Thr His Gly Met Asp Glu Leu Tyr Lys 
225 230 235 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: gfp (h) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..714 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATG AGT AAA GGA GAA GAA CTT TTC ACT GGA GTT GTC CCA ATT CTT GTT 48 
Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val 
240 245 250 



GAA TTA GAT GGT GAT GTT AAT GGG CAC AAA TTT TCT GTC AGT GGA GAG 96 
Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
255 260 265 270 

GGT GAA GGT GAT GCA ACA TAC GGA AAA CTT ACC CTT AAA TTT ATT TGC 144 
Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
275 280 285 

ACT ACT GGA AAA CTA CCT GTT CCA TGG CCA ACA CTT GTC ACT ACT TTC 192 
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe 
290 295 300 
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TCT TAT GGT GTT CAA TGC TTT TCA AGA TAC CCA GAT CAT ATG AAA CAG 240 
Ser Tyx Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
305 310 315 

CAT GAC TTT TTC AAG AGT GCC ATG CCC GAA GGT TAT GTA CAG GAA AGA 288 
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
320 325 330 

ACT ATA TTT TTC AAA GAT GAC GGG AAC TAC AAG ACA CGT GOT GAA GTC 336 
Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
335 340 345 350 

AAG TTT GAA GGT GAT ACC CTT GTT AAT AGA ATC GAG TTA AAA GGT ATT 384 
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly He 
355 360 365 

GAT TTT AAA GAA GAT GGA AAC ATT CTT GGA CAC AAA TTG GAA TAC AAC 432 
Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
370 375 380 

TAT AAC TCA CAC AAT GTA TAC ATC ATG GCA GAC AAA CAA AAG AAT GGA 480 
Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
385 390 395 

ATC AAA GTT AAC TTC AAA ATT AGA CAC AAC ATT GAA GAT GGA AGC GTT 528 
He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 
400 405 410 

CAA CTA GCA GAC CAT TAT CAA CAA AAT ACT CCA ATT GGC GAT GGC CCT 576 
Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
415 420 425 430 

GTC CTT TTA CCA GAC AAC CAT TAC CTG TCC ACA CAA TCT GCC CTT TCG 624 
Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
435 440 445 

AAA GAT CCC AAC GAA AAG AGA GAC CAC ATG GTC CTT CTT GAG TTT GTA 672 
Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
450 455 460 

ACA GCT GCT GGG ATT ACA CAT GGC ATG GAT GAA CTA TAC AAA 714 
Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 
465 470 475 



TAA 



717 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 238 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



WO 98/21355 



PCT/US97/21662 



-44- 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 



Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val 
1 5 10 15 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
20 25 30 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
35 40 45 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe 
50 55 60 

Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
65 70 75 " 80 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
85 90 95 

Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 110 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
115 120 125 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 

Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 

He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 
165 170 175 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
180 185 190 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 



Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
210 215 220 

Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 
225 230 235 

(2) INFORMATION FOR SEQ ID N0:5: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 238 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val 
15 10 15 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
20 25 30 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys 
35 40 45 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Cys 
50 55 60 



Ala Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
65 70 75 80 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
85 90 95 

Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 110 



Lys Phe Glu Gly Asp Thr Leu Val 

115 120 

Asp Phe Lys Glu Asp Gly Asn He 

130 135 

Tyr Asn Ser His Asn Val Tyr He 

145 150 

He Lys Val Asn Phe Lys He Arg 
165 

Gin Leu Ala Asp His Tyr Gin Gin 
180 

Val Leu Leu Pro Asp Asn His Tyr 

195 200 

Lys Asp Pro Asn Glu Lys Arg Asp 



Asn Arg He Glu Leu Lys Gly He 
125 

Leu Gly His Lys Leu Glu Tyr Asn 
140 

Met Ala Asp Lys Gin Lys Asn Gly 
155 160 

His Asn He Glu Asp Gly Ser Val 
170 175 

Asn Thr Pro He Gly Asp Gly Pro 
185 190 

Leu Ser Thr Gin Ser Ala Leu Ser 
205 

His Met Val Leu Leu Glu Phe Val 
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210 215 220 

Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 
225 230 235 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 238 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val 
15 10 15 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
20 25 30 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
35 40 45 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Met 
50 55 60 

Ala Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
65 70 75 80 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
85 90 95 

Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 110 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
115 120 125 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 

Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 

He Lys Val Asn Phe Lys He Arg His Asn lie Glu Asp Gly Ser Val 
165 170 175 
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Gln 



Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
180 185 190 



Val 



Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 



Lys 



Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
210 215 220 



Thr 
225 



Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 
230 235 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CAACACUGGU CACUACCTGC GCCTATGGCG TGC 33 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CCAACACUGG UCACUACCTG CACCTATGG 29 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 
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(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CAACACUGGU CACUACCCTC ACCTATGGCG TGCAGT 36 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CAACACUGGU CACUACAATG GCCTATGGCG TGCAGTGCT 39 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CAACACUGGU CACTACCATG ACCTATGGCG TGCAGTGCT 39 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 



(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:12: 
CAACACUGGU CACUACCATG TTCTTCGGCG TGCAGTGCT 39 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CAACACUGGU CACUACCATG TTCAAGGGCG TGCAGTGCT 39 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CAACACUGGU CACUACCACA TGCTATGGCG TGCAGT 36 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CAACACUGGU CACUACCGTG TGCTATGGCG TGCAGT 36 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
AGUGACCAGU GUUGGCCAAG GCACAGGGAG CTT 33 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



Pro Val Pro Trp Pro 
1 5 
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WHAT IS CLAIMED IS: 

1 . A nucleic acid molecule encoding a mutant Green Fluorescent 
Protein, said mutant Green Fluorescent Protein having an amino acid sequence 
comprising an amino acid residue lacking an aromatic ring structure at position 64 
and an amino acid residue having a side chain no longer than two carbon units in 
length at position 65, with the provisos that 

if said residue at position 64 is leucine then said residue at position 65 is 
not cysteine or threonine; 

if said residue at position 64 is valine then said residue at position 65 is not 

alanine; 

if said residue at position 64 is methionine then said residue at position 65 
is not glycine; and 

if said residue at position 64 is glycine then said residue at position 65 is 
not cysteine. 

2. The nucleic acid molecule of claim 1, wherein said amino acid 
residue at position 64 is selected from the group consisting of alanine, valine, 
leucine, isoleucine, proline, methionine, glycine, serine, threonine, cysteine, 
alanine, asparagine, glutamine, aspartic acid and glutamic acid. 

3. The nucleic acid molecule of claim 1, wherein said amino acid 
residue at position 64 is cysteine or methionine. 

4. The nucleic acid molecule of claim 3, wherein said amino acid 
residue at position 65 is alanine. 
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5. The nucleic acid molecule of claim 1, wherein said amino acid 
residue at position 65 is selected from the group consisting of alanine, glycine, 
threonine, cysteine, asparagine and aspartic acid. 

6. The nucleic acid molecule of claim 1, wherein said amino acid 
5 residue at position 65 is alanine, 

7. The nucleic acid molecule of claim 6, wherein said amino acid 
residue at position 64 is cysteine or methionine. 

8. A nucleic acid molecule encoding a mutant Green Fluorescent 
Protein, said mutant Green Fluorescent Protein having an amino acid sequence as 

10 set forth in SEQ ID NO:5. 

9. A nucleic acid molecule encoding a mutant Green Fluorescent 
Protein, said mutant Green Fluorescent Protein having an amino acid sequence as 
set forth in SEQ ID NO:6. 



10. A mutant Green Fluorescent Protein having an amino acid 
15 sequence comprising an amino acid residue lacking an aromatic ring structure at 

position 64 and an amino acid residue having a side chain no longer than two 
carbon atoms in length at position 65, with the provisos that 

(a) if said residue at position 64 is leucine then said residue at position 65 
is not cysteine or threonine; 
20 (b) if said residue at position 64 is valine then said residue at position 65 

is not alanine; 

(c) if said residue at position 64 is methionine then said residue at position 
65 is not glycine; and 
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(d) if said residue at position 64 is glycine then said residue at position 65 
is not cysteine. 

11. The mutant Green Fluorescent Protein of claim 10, wherein said 
amino acid residue at position 64 is selected from the group consisting of alanine, 
valine, leucine, isoleucine, proline, methionine, glycine, serine, threonine, cysteine, 
alanine, asparagine, glutamine, aspartic acid and glutamic acid. 

12. The mutant Green Fluorescent Protein of claim 10, wherein said 
amino acid residue at position 64 is cysteine or methionine. 

13. The mutant Green Fluorescent Protein of claim 12, wherein said 
amino acid residue at position 65 is alanine. 

14. The mutant Green Fluorescent Protein of claim 10, wherein said 
amino acid residue at position 65 is selected from the group consisting of alanine, 
glycine, threonine, cysteine, asparagine and aspartic acid. 

15. The mutant Green Fluorescent Protein of claim 10, wherein said 
amino acid residue at position 65 is alanine. 

16. The mutant Green Fluorescent Protein of claim 15, wherein said 
amino acid residue at position 64 is cysteine or methionine. 

17. A mutant Green Fluorescent Protein having an amino acid 
sequence as set forth in SEQ ID NO:5. 

18. A mutant Green Fluorescent Protein having an amino acid 
sequence as set forth in SEQ ID NO:6. 
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19. A host cell comprising the nucleic acid molecule of claim 1 . 

20. A vector comprising the nucleic acid molecule of claim 1 . 

21 . The vector of claim 20, wherein said vector is an expression vector. 

22. A host cell comprising the vector of claim 20. 

23. A method for producing a mutant Green Fluorescent Protein, 
comprising culturing the host cell of claim 19 or claim 22 under conditions 
favoring the production of a mutant Green Fluorescent Protein, and isolating said 
mutant Green Fluorescent Protein from said host cell. 

24. A mutant Green Fluorescent Protein produced by the method of 
claim 23. 

25. The mutant Green Fluorescent Protein of any one of claims 10, 17, 
18 or 24, wherein said mutant Green Fluorescent Protein emits fluorescent light 
when illuminated by white light. 

26. A composition comprising plasmid pGreenLantern-2/Al . 

27. A composition comprising plasmid pGreenLantern-2/A4. 

28. A composition comprising the mutant Green Fluorescent Protein 
of any one of claims 10, 17, 18 or 24. 

29. A composition comprising the mutant Green Fluorescent Protein 
of claim 25. 
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30. A kit for transfecting a host cell with a nucleic acid molecule 
encoding a mutant Green Fluorescent Protein, said kit comprising at least one 
container containing the nucleic acid molecule of claim 1 . 

3 1 . The kit of claim 30, wherein said nucleic acid molecule comprises 
plasmid pGreenLantern-2/Al or plasmid pGreenLantern-2/A4. 

32. The kit of claim 30, further comprising at least one additional 
container containing a reagent for delivering said nucleic acid molecule into a host 
cell. 

33 . The kit of claim 32, wherein said reagent for delivering said nucleic 
acid molecule into a host cell comprises a liposome. 

34. A kit for labeling a polypeptide with a mutant Green Fluorescent 
Protein, said kit comprising at least one container containing the mutant Green 
Fluorescent Protein of any one of claims 10, 17, 18 or 24. 

35. The kit of claim 34, wherein said mutant GFP fluoresces when 
illuminated by white light. 

36. The kit of claim 34, further comprising at least one additional 
container containing a reagent for covalently attaching said mutant Green 
Fluorescent Protein to a polypeptide. 

37. A method of detecting the presence of a mutant GFP comprising 
illuminating the mutant GFP with a source of white light under conditions such 
that the mutant GFP emits visible fluorescent light. 
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38. A method of detecting the presence of a cell expressing a mutant 
GFP comprising illuminating the cell with a source of white light under conditions 
such that the mutant GFP expressed by the cell emits visible fluorescent light. 
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AT6 AGT AAA GGA GAA GAA CTT TTC ACT GGA GTT GTC CCA ATT CTT GTT 
Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val 
15 10 15 

GAA TTA GAT GGT GAT GTT AAT GGG CAC AAA TTT TCT GTC AGT GGA GAG 
Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
20 25 30 

GGT GAA GGT GAT GCA ACA TAC GGA AAA CTT ACC CTT AAA TTT ATT TGC 
Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
35 " 40 45 

ACT ACT GGA AAA CTA CCT GTT CCA TGG CCA ACA CTT GTC ACT ACT TTC 
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe 
50 55 60 

TCT TAT GGT GTT CAA TGC TTT TCA AGA TAC CCA GAT CAT ATG AAA CAG 
Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
65 70 75 80 

CAT GAC TTT TTC AAG AGT GCC ATG CCC GAA GGT TAT GTA CAG GAA AGA 
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
85 90 95 

ACT ATA TTT TTC AAA GAT GAC GGG AAC TAC AAG ACA CGT GCT GAA GTC 
Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 " 110 

AAG TTT GAA GGT GAT ACC CTT GTT AAT AGA ATC GAG TTA AAA GGT ATT 
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
115 120 ~ 125 

GAT TTT AAA GAA GAT GGA AAC ATT CTT GGA CAC AAA TTG GAA TAC AAC 
Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 

TAT AAC TCA CAC AAT GTA TAC ATC ATG GCA GAC AAA CAA AAG AAT GGA 
Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 

ATC AAA GTT AAC TTC AAA ATT AGA CAC AAC An GAA GAT GGA AGC GTT 
He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 
165 170 175 

CAA CTA GCA GAC CAT TAT CAA CAA AAT ACT CCA ATT GGC GAT GGC CCT 
Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
180 185 190 

GTC CTT TTA CCA GAC AAC CAT TAC CTG TCC ACA CAA TCT GCC CTT TCG 
Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 

AAA GAT CCC AAC GAA AAG AGA GAC CAC ATG GTC CTT CTT GAG TTT GTA 
Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
210 215 220 

ACA GCT GCT GGG ATT ACA CAT GGC ATG GAT GAA CTA TAC AAA TAA 
Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys * 
225 230 235 

(SEQ ID N0s:l, 2) 

FIG.1 

SUBSTITUTE SHEET (RULE 26) 
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ATG 
Met 


AGT 
Ser 
240 


AAA 
Lys 


GGA 
Gly 


GAA 
Glu 


GAA 
Glu 


CTT 
Leu 
245 


TTC ACT 
Phe Thr 


GGA 
Gly 


GTT 
Val 


GTC 
Val 
250 


CCA 
Pro 


ATT 
He 


cn 

Leu 


Gn 

Val 


48 


GAA 
Glu 
255 


TTA 
Leu 


GAT 
Asp 


GGT 
Gly 


GAT 
Asp 


GTT 
Val 
260 


AAT 
Asn 


GGG CAC 
Gly His 


AAA 
Lys 


TTT 
Phe 
265 


TCT 
Ser 


GTC 
Val 


AGT 
Ser 


GGA 
Gly 


GAG 
Glu 
270 


96 


GGT 
Gly 


GAA 
Glu 


GGT 
Gly 


GAT 
Asp 


GCA 
Ala 
275 


ACA 
Thr 


TAC 
Tyr 


GGA AAA 
Gly Lys 


cn 

Leu 
280 


ACC 
Thr 


CTT 
Leu 


AAA 

Lys 


TTT 
Phe 


An 

He 
285 


TGC 
Cys 


144 


ACT 
Thr 


ACT 
Thr 


GGA 
Gly 


AAA 
290 


CTA 
Leu 


CCT 
Pro 


GTT 
Val 


CCA TGG 
Pro Trp 
295 


CCA 
Pro 


ACA 
Thr 


CTT 
Leu 


GTC 
Val 


ACT 
Thr 
300 


ACT 
Thr 


nc 

Phe 


192 


TCT 
Ser 


TAT 
Tyr 


GGT 
Gly 
305 


GTT 
Val 


CAA 
Gin 


TGC 
Cys 


TTT 
Phe 


TCA AGA 
Ser Arg 
310 


TAC 
Tyr 


CCA 
Pro 


GAT 
Asp 


CAT 
His 
315 


ATG 
Met 


AAA 
Lys 


CAG 
Gin 


240 


CAT 
His 


GAC 
Asp 
320 


TTT 
Phe 


TTC 
Phe 


AAG 
Lys 


AGT 
Ser 


GCC 
Ala 
325 


ATG CCC 
Met Pro 


GAA 
Glu 


GGT 
Gly 


TAT 
Tyr 
330 


GTA 
Val 


Gin 


GAA 
Glu 


AGA 
Arg 


288 


ACT 
Thr 
335 


ATA 
He 


TTT 
Phe 


TTC 
Phe 


AAA 
Lys 


GAT 
Asp 
340 


GAC 
Asp 


GGG AAC 
Gly Asn 


TAC 
Tyr 


AAG 
Lys 
345 


ACA 
Thr 


CGT 
Arg 


GCT 
Ala 


GAA 
Glu 


GTC 
Val 
350 


336 


AAG 
Lys 


TIT 
Phe 


GAA 
Glu 


GGT 
Gly 


GAT 
Asp 
355 


ACC 
Thr 


cn 

Leu 


GTT AAT 
Val Asn 


AGA 
Arg 
360 


ATC 
He 


GAG 
Glu 


TTA 
Leu 


AAA 
Lys 


GGT 
Gly 
365 


An 

He 


384 


GAT 
Asp 


TTT 
Phe 


AAA 
Lys 


GAA 
Glu 
370 


GAT 
Asp 


GGA 
Gly 


AAC 
Asn 


ATT CTT 
He Leu 
375 


GGA 
Gly 


CAC 
His 


AAA 
Lys 


TTG 
Leu 


GAA 
Glu 
380 


TAC 
Tyr 


AAC 
Asn 


432 


TAT 
Tyr 


AAC 
Asn 


TCA 
Ser 
385 


CAC 
His 


AAT 
Asn 


GTA 
Val 


TAC 
Tyr 


ATC ATG 
He Met 
390 


GCA 
Ala 


GAC 
Asp 


AAA 

Lys 


CAA 
Gin 
395 


AAG 
Lys 


AAT 
Asn 


GGA 
Gly 


480 


ATC 
He 


AAA 


GTT 
Val 


AAC 
Asn 


TTC 
Phe 


AAA 
Lys 


ATT 
He 
405 


AGA CAC 
Arg His 


AAC 
Asn 


ATT 
He 


GAA 
Glu 
410 


GAT 
Asp 


GGA 
Gly 


AGC 
Ser 


Gn 

Val 


528 


CAA 
Gin 
415 


CTA 
Leu 


GCA 
Ala 


GAC 
Asp 


CAT 
His 


TAT 
Tyr 
420 


CAA 
Gin 


CAA AAT 
Gin Asn 


ACT 
Thr 


CCA 
Pro 
425 


ATT 
He 


GGC 
Gly 


GAT 
Asp 


GGC 
Gly 


CCT 
Pro 
430 


576 


GTC 
Val 


cn 

Leu 


TTA 
Leu 


CCA 
Pro 


GAC 
Asp 
435 


AAC 
Asn 


CAT 
His 


TAC CTG 
Tyr Leu 


TCC 
Ser 
440 


ACA 
Thr 


CAA 
Gin 


JCT 
Ser 


GCC 
Ala 


cn 

Leu 
445 


TCG 
Ser 


624 


AAA 
Lys 


GAT 
Asp 


CCC 
Pro 


AAC 
Asn 
450 


GAA 
Glu 


AAG 
Lys 


AGA 
Arg 


GAC CAC 
Asp His 
455 


ATG 
Met 


GTC 
Val 


CTT 
Leu 


CTT 
Leu 


GAG 
Glu 
460 


nr 

Phe 


GTA 
Val 


672 


ACA 
Thr 


GCT 
Ala 


GCT 
Ala 
465 


GGG 
Gly 


ATT 
He 


ACA 
Thr 


CAT 
His 


GGC ATG 
Gly Met 
471) 


GAT 
Asp 


GAA 
Glu 


CTA 
Leu 


TAC 
Tyr 
475 


AAA 
Lys 






714 



TAA 717 

(SEQ ID N0s:3, 4) 



FIG. 2 



SUBSTITUTE SHEET (RULE 26) 
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Met Ser Lys Gly 61 u Glu Leu Phe Thr Gly Val Val Pro He Leu Val 
15 10 15 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
20 25 30 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
35 40 45 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Cys 
50 -55 60 

Ala Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
65 70 75 80 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
85 90 95 

Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 110 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
115 120 125 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 

Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 

He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 
165 170 175 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
180 185 190 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 

Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
210 215 220 

Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 
225 230 235 
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Met Ser Lys Gly Glu 61 u Leu Phe Thr Gly Val Val Pro He Leu Val 
15 10 15 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
20 25 30 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
35 40 45 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Met 
50 55 60 

Ala Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
65 70 75 80 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
85 90 95 

Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 110 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
115 120 125 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 

Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 

He Lys Val Asn Phe Lys lie Arg His Asn He Glu Asp Gly Ser Val 
165 170 175 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
180 185 190 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 

Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
210 215 220 

Thr Ala Ala Gly lie Thr His Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



(SEQ ID NO: 6) 

FIG.4 
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FIG.6 
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FIG. 7 
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FIG. 8 
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FIG. 9 
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